Discovering Context-Topic Rules in Search Engine Logs
نویسندگان
چکیده
We present a class of rules, called context-topic rules, for discovering associations between topics and contexts, where a context is defined as a set of features that can be extracted from the log file of a Web search engine. We introduce a notion of rule interestingness that measures the level of relevance of the topic within a context, and provide an algorithm to compute concise representations of interesting context-topic rules. Finally, we present the results of applying the methodology proposed to a large data log of a search engine. Joint work with Mark Levene.
منابع مشابه
Web User Search Pattern Analysis for Modeling Query Topic Changes
Web search engine logs are a good source of information for Web user modeling in which user session analysis is often incurred. However, studies on Web logs assume a user session to cover the complete time period of the data set. In the absence of any further information, we define a user session to be related to the user search topics. Viewing sessions in this way can help overcome problems du...
متن کاملPosition Paper: Access to Query Logs – An Academic Researcher’s Point of View
Academic researchers have very limited access to query logs of major web search engines. Studying and analyzing large-scale query logs is essential for advancing Web IR. We propose setting up review boards with clear rules for appropriate conduct, and allowing researchers access to logs within this framework.
متن کاملQuery Topic Classification and Sociology of Web Query Logs
In the paper, the objects, tasks, and a general procedure of the sociological analysis of Web search engine query logs are described and illustrated by a methodologically complete study of the cross-nation search image changes based on two-year spaced query logs of the national search audience.
متن کاملMining Low-Risk Rules for Altering Query Terms from Large-Scale Logs of Query Reformulations
A widely-used method that Web search engines use to improve relevance is automatically altering terms in the user‘s query, in order to overcome potential vocabulary mismatches between the query and relevant Web pages. In commercial search engines, a large percentage of all queries are altered in some way. While such query alteration has significant upside potential to improve relevance for many...
متن کاملLearning Rewrite Rules for Search Database Systems Using Query Logs
Recent literature on “search database systems” has introduced the notion of using query rewrite rules to influence the behavior of a search engine. Rewrite rules enable domain experts and search administrators to customize the search engine by providing a powerful rule-driven framework to transform user search queries. In this paper, we address the important problem of automatically learning su...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006